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DETAILED ACTION 
Response to Amendment 

1. In view of the appeal brief filed on 1/17/06, PROSECUTION IS HEREBY 
REOPENED. New grounds of rejection are set forth below. 

To avoid abandonnnent of the application, appellant must exercise one of the 
following two options: 

(1 ) file a reply under 37 CFR 1.111 (if this Office action is non-final) or a reply 
under 37 CFR 1.113 (if this Office action is final); or, 

(2) initiate a new appeal by filing a notice of appeal under 37 CFR 41 .31 followed 
by an appeal brief under 37 CFR 41 .37. The previously paid notice of appeal fee and 
appeal brief fee can be applied to the new appeal. If, however, the appeal fees set forth 
in 37 CFR 41 .20 have been increased since they were previously paid, then appellant 
must pay the difference between the increased fees and the amount previously paid. 

Response to Arguments 

2. Applicant's arguments filed 1/1 7/06 have been fully considered but they are not 
persuasive. 

3. In response to Applicant's arguments, filed 1/17/06, page 9, "White only switches 
to utilizing the remote system when a command is not understood, not "upon finding the 
attention word". The Examiner cannot concur. 

White explicitly states, C.14Jines 2-7, "If local device 14 is not able to respond 
by itself (e.g., it cannot recognize a user's spoken command) or, 
alternatively, if a user triggers local device 14 with a "wake up" command, 
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local device 14 initiates communication with remote system 12, Remote 
...respectively.", in C.1 /.lines 30-32 "Such speech may comprise one or 
more commands in the form of keywords — e.g., " Start, " " Turn on, " or 
simply "On" which are recognizable bv resident VUl 36 of local device ." 

This makes evident that the "wake up" phrases used the local devices 
recognition models, etc. and then switches to the remote devices faculties 
without "not recognizing a user's spoken command" as stated by applicant, 
wherein White explicitly deals with the activation/initiation of a local device, etc. 
Claim Rejections - 35 USC § 102 

4. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(e) the invention was described in (1) an application for patent, published under section 122(b), by 
another filed in the United States before the invention by the applicant for patent or (2) a patent 
granted on an application for patent by another filed in the United States before the invention by the 
applicant for patent, except that an international application filed under the treaty defined in section 
351 (a) shall have the effects for purposes of this subsection of an application filed in the United States 
only If the international application designated the United States and was published under Article 21(2) 
of such treaty in the English language. 

5. Claim 17 is rejected under 35 U.S.C. 102(e) as being anticipated by 

6. Geilhufe et al, (Geilhufe, US 6,584,439) 

As per claim 17, Geilhufe teaches a method of speech recognition comprising: 
searching for an attention word based on a first context including a first set of 

grammar models (C.IS.Iines 47-58-his "Aardvark" as the attention word, the first set of 

grammar model recognizes attention words); and 
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switching upon finding the attention word to a second context to search for an 
open- ended user request , wherein second context includes a second set of models, 
grammar and lexicons (ibid, wherein the open ended request if for the system to "Call 
mom", and encompasses the second context, wherein grammar specific function for 
calling mom is realized, in the new context). 

Claim Rejections - 35 USC § 103 

7. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 1 02 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

8. Claims 1-16 and 26-30, and 32-44 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over Junqua etal (Junqua, 6,324,512 B1) in view of Giuliani et al (Giuliani, 
Hands free Continuous Speech Recognition in Noisy Environment Using a Four 
Microphone Array) and White et al (White, 6,408,272 B1) 

As per claims 1 and 5, Junqua et al teach a natural language interface control 
system for operating a plurality of devices comprising (figure 1): 

" feature extraction module coupled to the first microphone" this signal 
processing component 68, col. 15, lines 53-67); 

"a speech recognition module coupled to the feature extraction module, utilizes 
hidden Markov models; (His speech recognizer 20, col, 2, lines 35-55, Fig. 4); and 

"A device interface coupled to the natural interface module "(His natural 
language parser 26, col. 2, lines 52-61), "wherein the natural language interface module 
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is for operating a plurality of devices coupled to the device interface based upon non- 
prompted, open- ended natural language request from a user*' (his abstract, lines 1-5; 
col. 2, lines 62-67 his unified access controller 30, his digital tuner 40 and his recorder 
44, col. 3, lines 9-17) 

but lacks wherein the natural language interface module abstracts each of the 
plurality of devices into a respective one of the different grammars and a respective one 
of a plurality of lexica corresponding to each of the plurality of devices 

However, Geilhufe teaches an interface module abstracts ...each of the plurality 
of devices (C.17.lines 6-10, C.19.lines 33-37, C. 18. lines 1-4-wherein each device has 
"abstracted", core commands, and commands specific to a given application). 
Therefore, at the time of the invention, it would have been obvious to modify Junqua's 
natural language parser and unified access controller with Geilhufe's device specific 
grammar and lexicon (vocabulary/specific list of commands). The motivation for doing 
so would have been to each device respond to specific commands appropriately 
(CIS. lines 1-4, 47-57-wherein "Aardvark call mom" results in calling mom from a 
desktop phone, by a command definition of call as a specific command to a phone 
device, and not, for example, a transcription of "Aardvark Call mom" into a document) 

It is noted that the Junqua in view of Geilhufe teaches the claimed invention but 
does not explicitly teach a 3 dimensional microphone array. However, this feature is well 
known in the art as evidenced by Giuliani et al who teach a four microphone array. 
Therefore, one of ordinary skill in the art at the time invention was made would have it 
obvious to substitute the microphone taught by Junqua by the array of microphone 
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taught by Giuliani because it would improve the signal quality in a noisy environment 
(see Giuliani page 860). 

It is further noted that the combination teaches the claimed invention but does 
not explicitly teach wherein the speech recognition module can switch between different 
acoustic models and different grammars, wherein at least one of the different acoustic 
models and at least one of the different grammars is downloaded over a network . 
However, this feature is well in the art as evidenced by White et al who teach a 
distributed voice interface system that includes a remote system, which may 
communicate with a number of local devices where data can be downloaded from the 
remote system to the local devices at col. 3, lines 25-32 and col. 16, lines 1-15, and 
teaches having the natural language interface to the speech recognition, C.6.lines 35- 
40-resident on a VUl abstracting a plurality of devices, C.4.lines 55-60, C.5.lines 39-54, 
and C.6.lines 32-55-the natural language through the VUl functions to specific 
information grammars and lexica from remote locations to operate each of a plurality of 
local devices, thereby switching grammars, acoustic models, C.14.lines 2-7, "If local 
device 14 is not able to respond by itself (e.g., it cannot recognize a user's 
spoken command) or, alternatively, if a user triggers local device 14 with a "wake 
up" command, local device 14 initiates communication with remote system 12. 
Remote ...respectively.", in C.17.lines 30-32 "Such speech may comprise one or 
more commands in the form of keywords — e.g., " Start, " " Turn on, " or simply 
"On" which are recognizable bv resident VUl 36 of local device ). Therefore, one 
having ordinary skill in the art at the time the invention was made would have it obvious 
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to incorporate the combination as taught by Junqua, Geilhufe with Giuliani into a 
distributed system as taught by White et a! because the data already present in each 
local device can be updated, replaces or supplemented ms desired to modify the voice 
user interface capability (White et al's col. 3, lines 28-34). 

As per claim 2, the combination teaches the plurality of devices coupled to the 
natural language interface module (Junqua figure 1, his natural language parser 26 and 
his digital tuner 40 and his recorder 44; White his speech recognition engine 40 and 70) 

As per claim 3, Junqua et al wherein the speech recognition module utilizes an 
N-gram grammar (col. 7, line 65 to col. 8, line 2). 

As per claims 4, Junqua et al wherein the natural language interface module 
utilizes a Probabilistic context free grammar ( figure 1 , his natural language parser 26, 
col. 5. lines 5-11). 

As per claims 6-8 (see rejection 1 above) the combination of Junqua, Geilhufe 
(context switching with respect to "attention word") 

As per claims 9 and 10, (see rejection of claim 1), the combination further 
teaches a grammar module for storing different grammars for each of the plurality of 
devices (see Geilhufe, application specific command discussion, also context switching 
as it relates to "attention words") and White switches grammars, .... acoustic models 
from remote sites, upon receipt of a keyword. Therein searching for the non-prompted 
open-ended, natural language requests upon the receipt and recognition of an "attention 
word" or keyword). 
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As per claims 11-16, the combination teaches wherein the device comprises a 
wireless device interface (White, col. 2, lines 55-64, col. 5, lines 39-47),. an external 
network coupled to the natural language interface (Junqua, his internet access 64); 
wherein said 3 dimensional microphone array includes the first microphone ( see 
Giuliani, his four microphone array) 

9. Claims 26-30, and 32-44 are the same in scope and content as claims 1-16 
above and therefore are rejected under the same rationale. 

Conclusion 

1 0. The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 

Diehl et al. (US 6,052,666) teaches a natural language interface that 
abstracts a plurality of devices into respective ones of different grammars 
and lexica according to each of the devices. 

Stanford et al. (US 5,615,296) teaches context switching upon identifying 
an attention word. 

1 1 . Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Lament M. Spooner whose telephone number is 
571/272-7613. The examiner can normally be reached on 8:00 AM - 5:00 PM. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Richemond Dorvil can be reached on 571/272-7602. The fax phone number 
for the organization where this application or proceeding is assigned is 703-872-9306. 
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Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. 

Should you have questions on access to the Private PAIR system, contact the 
Electronic Business Center (EBC) at 866-217-9197 (toll-free). 



A Supervisory Patent Examiner (SPE) has approved of reopening prosecution by 
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